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Abstract 

The precision measurements envisaged at the International Linear Colhder (ILC) 
depend on excellent instrumentation and reconstruction software. The correct iden- 
tification of heavy flavour jets, placing unprecedented requirements on the quality of 
the vertex detector, will be central for the ILC programme. This paper describes the 
LCFIVertex software, which provides tools for vertex finding and for identification 
of the flavour and charge of the leading hadron in heavy flavour jets. These tools 
are essential for the ongoing optimisation of the vertex detector design for linear 
colliders such as the ILC. The paper describes the algorithms implemented in the 
LCFIVertex package, as well as the scope of the code and its performance for a 
typical vertex detector design. 

Key words: Simulation, Reconstruction, Monte Carlo, Software tools. Vertex 
detectors, Linear collider 
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1 Introduction 

1.1 Purpose and scope of the LCFIVertex package 

The International Linear Collider (ILC), colliding electrons and positrons, 
is envisaged by the particle physics community to be the next high energy 
accelerator for particle physics research. Its centre of mass energy is planned 
to initially range from 200-500 GeV for physics runs and down to 91 GeV for 
cahbration purposes, with the possibility of a later upgrade to 1 TeV [1]. The 
ILC with its well-known momentum and spin state of the interacting parti- 
cles will be complementary to the Large Hadron Collider (LHC), providing 
capability of precise measurements of new physics phenomena and indirect 
studies of phenomena at energy scales well beyond the direct energy reach of 
both the ILC and LHC [2] . Overviews of the physics accessible at the ILC are 
given in the TESLA Technical Design Report [2j and in the ILC Reference 
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Design Report [T]. The complementarity of the ILC and the LHC has been 
investigated in detail by the LHC/ILC Study Group 1^. 

Extraction of new physics accessible at the ILC will rely not only on high 
quality of the colliding beams, but also on the use of hermetic, high-precision 
detector systems to record the signals of the products of the collisions, as 
well as excellent reconstruction software to analyse these events. Since ILC 
physics is expected to be rich in final states with heavy flavour jets, it will be 
important to be able to distinguish 6 jets, for which the leading hadron contains 
a bottom valence quark, from c jets containing hadrons with a charm valence 
quark and jets arising from light {u, d, s) quark hadronisation. Crucial for 
this "flavour tag" is the high precision measurement of the tracks of charged 
particles in the innermost detector system, the vertex detector, permitting 
reconstruction of the decay vertices of heavy flavour hadrons. As shown at 
previous experiments, a sufficiently precise and mechanically stable detector 
permits the reconstruction of both the primary vertex at the point where 
the particle beams collide, and the full decay chain in heavy flavour jets. For 
example, in a typical b jet containing a B hadron decay 5 mm away from 
the interaction point (IP), resulting in a D hadron that decays, e.g., 3mm 
further away, it is often possible to reconstruct all three vertices from the 
tracks in the jet. Jet flavour identification is aided by observables derived from 
these vertices, such as the mass and momentum of the leading hadron that 
decayed. Further, measurement of the vertex charge permits one to determine 
if the heavy fiavour parton is a quark or an antiquark, opening up a range of 
measurements that would otherwise be inaccessible. 

The LCFIVertex package provides software for vertex finding, fiavour tagging 
and vertex charge reconstruction. In the current phase of ILC detector research 
and development, the code is intended for the optimisation of the vertex de- 
tector design. Furthermore, it is currently being used for optimisation of the 
overall ILC detector concepts, which requires fiavour tagging in order to study 
the benchmark physics processes chosen to assess the performance of different 
detector designs. 

This paper describes the functionality provided by the LCFIVertex package, 
as well as the performance achieved for a typical ILC vertex detector design. 
Emphasis is on the algorithms; further technical information can be found 
in the software documentation [4J. The paper is structured as follows: the 
vertex detector design used for the performance examples and the software 
framework to which the LCFIVertex code is interfaced are described in the 
remainder of this section. Section [2] describes the track selection used for the 
different parts of the code; in particular, the algorithms to suppress tracks 
stemming from photon conversions in the tracking volume and from the decay 
of short-lived A and Ks hadrons are described. Section [3] explains the vertex 
finding algorithms. Section H] the fiavour tag and neural net software on which 
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the flavour tagging approach rehes and Section [5] the algorithm for quark 
charge reconstruction. Examples of the resulting performance for a typical 
detector design, using a sample of e"'"e~ — >• 7/Z — gg with q = u, d, s, c, b at 
a centre-of-mass energy ^/s = 91.2 GeV, unless otherwise stated, are presented 
in addition to the algorithms. Section[6]gives a summary of the paper. Software 
versions and parameter settings used to produce the results in this paper are 
described in Appendices A-C. 



1.2 The detector design used for performance evaluation 

The results presented in this paper were obtained with the detector model 
LDCPrime_02Sc This detector design evolved from the earlier TESLA de- 
tector geometry [2] and was implemented in a Geant4-based detector simula- 
tion by the LDC ("Large Detector Concept") study group. It relies mostly 
on a pixel-based silicon vertex detector and a time projection chamber (TPC) 
to provide charged particle tracking. These detectors are located inside a 
solenoid which provides a magnetic field of 3.5 T. Also inside the solenoid are 
fine grained electromagnetic (ECAL) and hadronic (HCAL) calorimeters. Ad- 
ditional tracking and calorimetry is foreseen in the forward region, which is 
particularly important at ILC energies, given that many of the relevant physics 
processes are expected to give rise to multi-jet final states with at least one jet 
in the forward direction. The tracking detector layout of the LDCPrime_02Sc 
detector model is shown in Figure [TJ 

The vertex detector geometry in LDCPrime_02Sc consists of five barrel layers 
of silicon sensors evenly spaced between the inner radius of 15 mm and the 
outer radius of 60 mm. The length of the active area is 250 mm for the outer 
four layers and 100 mm for the innermost layer. The number of sensor staves 
per layer ranges between 10 and 18. Sensors are assumed to be mounted onto 
a carbon fibre support structure, with the combined material budget of sensor 
and support structure corresponding to 0.1 % of a radiation length per layer. 
As carbon fibre is not among the materials available in Geant, the support is 
described as a 0.134 mm thick layer of graphite in the simulation. At the ends 
of the barrel staves, the amount of material is larger due to the electronics 
and mechanical fixtures needed. In the barrel region, the silicon intermediate 
tracker (SIT), consisting of two layers of silicon sensors located at radii 160 mm 
and 270 mm, respectively, helps link the track segments measured in the vertex 
detector to those provided by the TPC. In the detector design considered, the 
TPC drift region has an inner radius of 37.1 cm, an outer radius of 180 cm and 
a half-length of 224.8 cm. In the forward direction, tracking is complemented 
by a silicon strip detector, the forward track detector (FTD), comprised of 7 
disks located at z positions ranging from 235 mm to 1997.5 mm with inner radii 
between 23.8 mm and 162.7 mm and outer radii between 140 mm and 280 mm. 
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Fig. 1. Tracking system layout of the LDCPrime-02Sc detector model used for the 
LCFIVertex performance studies presented in this paper. The vertex detector is 
shown with cryostat and cables as implemented in the detector simulation. The outer 
forward tracking disks FTD^^l are simulated within a support cylinder. The TPC 
endcap is shown on the right. The plot does not cover the entire radial extent of the 
TPC. 

The inner radius of the sihcon-tungsten ECAL is 182.5 cm and its outer radius 
is 201.1cm. The ECAL is surrounded by an iron-scintillator HCAL with an 
inner radius of 202.5 cm. Calorimeter cells are squares with sides of length 
5 mm in the ECAL and 30 mm in the HCAL. In the forward region, an ECAL 
endcap extends from z = 245 cm to 2; = 264 cm, followed by an HCAL from 
z = 294 cm. 



1.3 LCIO data format and the MarliuReco software framework 

The results presented in this paper were obtained with input from the Pythia 
event generator [6]. The detector response was simulated using the Geant4- 
based [7] program MOKKA [8J and the detector model LDCPrime_02Sc, de- 
scribed above. 

The Linear Collider I/O (LCIO) persistency framework [9] permits storage 
of results between the stages of detector simulation, event reconstruction and 
physics analysis, and the exchange of these results between the different soft- 
ware frameworks used for ILC detector optimisation. One of the main roles 
of LCIO is to provide a common data model, in which an Event entity holds 
collections of objects relevant to the different stages of reconstruction, as well 
as some of the Monte Carlo (MC) information from the event generator. 

Reconstruction was performed using tools from the MarlinReco event recon- 
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struction package [TO], based on the particle flow concept and implemented 
within the modular C++ application framework Marlin [11]. The modules are 
called "processors" in this framework and can be conflgured by steering flies 
in xml format. 

Speciflc MarlinReco processors were used to simulate the digitization in the 
different detector subsystems. In the case of the silicon-based detectors, the 
physical processes occurring in the silicon sensors, such as the drift of the 
electrons and holes that give rise to the signals, are described using simple pa- 
rameterisations. Track flnding, including pattern recognition, and track fltting 
was then performed using the FullLDCTracking processor [T^][T3] . 

Calorimeter clusters and tracks are matched using a particle flow algorithm 
(PEA). For this purpose, the PandoraPFA code [2] was used, resulting in 
a collection of ReconstructedParticles. These are the entities in LCIO on 
which analyses are usually based, and which are used, for example, as the input 
to jet flnding algorithms. The LCFIVertex code is run on jets, and hence for 
studies in the context of PFA-oriented detectors, like the one described here, 
requires the PFA code to be run before the jet flnding step. Calorimeter clus- 
ters are required by the PFA algorithm as part of the input, and therefore also 
need to be reconstructed before this algorithm can be run. Via the jet energy, 
calculated in the PFA code, calorimeter clusters are indirectly used for the 
flavour tag. Otherwise, the LCFIVertex code is independent of the calorimeter 
information and only accesses the tracks in the input jets. It was checked that 
when running a different particle flow algorithm available in MarlinReco, Wolf 
jl5] . flavour tagging performance did not change. The Durham fc^-cluster jet 
flnding algorithm, as implemented in the SatoruJetFinder package in Mar- 
linReco [16], was then run, forcing each event into a two-jet topology. Tracks 
from photon conversions. Kg and A decays were identifled by a conversion tag- 
ger that forms part of LCFIVertex, see Section [2TT1 These tracks were excluded 
from the input passed to the LCFIVertex processors. 

A simple processor to determine the event vertex or interaction point (IP) 
from an iterative flt to a subset of all tracks in an event was run. This IP- 
flt processor and the conversion tagger are the only parts of the LCFIVertex 
package that are not jet-based. The resulting event vertex is used both by the 
vertex flnding algorithms described below and for the flavour tag to determine 
a track's point of closest approach to the IP. 



2 Track Selection 

The default flavour tag procedure implemented in the LCFIVertex package, 
as described in more detail in Section HJ uses secondary vertex information 
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whenever available. Different neural networks with separate sets of input vari- 
ables are used depending on whether secondary vertices are found or not. In 
particular, for distinguishing c jets from uds jets, a useful criterion is that 
uds jets do not contain vertices stemming from the decays of heavy flavour 
hadrons. It must therefore be ensured that the vertex finder creates as few 
fake vertices from wrong track combinations as possible. This can partly be 
achieved by appropriate track selection, as discussed in Section 12. 1[ Other 
potential sources of unwanted vertices for the purpose of flavour tagging are 
photon conversions in the detector material and the decay of Ks and A par- 
ticles, which can resemble a heavy flavour decay vertex. As these effects have 
clear signatures, such tracks can easily be identified and suppressed by the 
track selection. 



2. 1 Identification of tracks from photon conversions, Ks and A decays 

A dedicated Marlin processor is used to identify tracks from Ks and A decays 
and from photon conversions. All two-track combinations are considered as 
candidates and are required to pass the following criteria to be identified as 
conversions or Ks/A decays: 

• The constituent tracks must have opposite charge sign. 

• The distance of closest approach between the two track helices must not 
exceed 1 mm. 

• The distance between the point of closest approach and the IP must be larger 
than 1 mm in order to reduce the risk of fake tags consisting of combinations 
of primary vertex tracks. 

• The invariant mass of the two track combination has to be compatible with 
that of the photon, the Ks or the A. 

To check the mass compatibility, the rest mass of the combination is calcu- 
lated using three mass hypotheses, choosing the masses of the decay products 
accordingly: both tracks are assumed to be electrons in the case of conver- 
sions, or pions for the Ks hypothesis, and for the A hypothesis, the track with 
larger momentum is assigned the proton mass and the other the pion mass. 
The resulting rest mass of the combination is considered to be compatible 
with the hypothesis if it differs from the PDG value by not more than 5 MeV 
for conversions and Kaons and by not more than 2 MeV for Lambdas. Table [T] 
lists performance figures for each particle type separately. All objects that 
are not identified as stemming from the above sources are passed on to the 
track selection processor preceding the subsequent steps such as vertex finding 
and flavour tagging. 
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cliiciciicy 


p ui iiy 


photon 


24.6% 


96.3% 


Ks 


72.2% 


67.4% 


A 


69.8% 


62.3% 



Table 1 

Performance of the conversion, Ks and A identification. The efficiency is nor- 
malised to particles where all conversion respectively decay tracks were recon- 
structed in the detector, i.e. to particles that could potentially lead to reconstruc- 
tion of additional vertices. The purity is defined as the fraction of reconstructed 
conversion/Ks/A candidates with a correct combination of tracks. Selection cuts 
for each particle type were chosen to optimise flavour tag performance, not to opti- 
mise standalone performance of the conversion/Ks /A tagger. 

In addition to this core functionality, the conversion tagger can also be run in 
one of two "cheater" modes, which use MC information to identify conversions 
and Ks or A decays and can be used to assess the performance of the realistic 
reconstruction. The two cheater modes differ in the way they treat the case 
that only one of the tracks from a conversion or Ks/A decay has been recon- 
structed, the other track not being within the detector acceptance or being 
lost due to pattern recognition inefficiencies. The more moderate cheater mode 
then does not flag the track that could be reconstructed as resulting from a 
conversion or Ks/A decay, since no realistic algorithm could possibly identify 
such tracks, whereas the more aggressive cheater mode flags all tracks that 
stem from these sources. 

Figure [2] shows the flavour tagging performance in terms of purity vs. efficiency 
for the three tags provided (see Section 14.31 for details) when the conversion 
tagger is run as part of the reconstruction chain (full symbols). Performance 
without any Ks/A identification and with only the conversions that Pando- 
raPFA finds removed, i.e. no conversion finding in the vertex detector, is plot- 
ted for comparison (open symbols). Also shown is the performance obtained 
from the moderate cheater mode (lines). 

2.2 Track selection for IP fit, ZVTOP and flavour tag 

The track selection can be tuned separately for the IP fit, the ZVTOP vertex 
finder and the calculation of the fiavour tag inputs. An overview of the track 
selection cuts for each of these tasks is given in Table [2J In addition to the 
cuts listed in the table, requirements are implemented on the number of hits 
in the tracking subdetectors as follows: if there are at least 20 hits in the TPC 
or at least three hits in the FTD, no hit is required in the vertex detector. If 
there are fewer hits in the TPC or FTD, at least three vertex detector hits 
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Fig. 2. Comparison of tagging with and without conversion tagging and performance 
obtained when using MC information at y/s = 91.2 GeV. The horizontal axis shows 
the efficiency with which true h (c) jets are tagged, and the vertical axis shows the 
fraction of true h (c) jets in the sample of tagged jets. The performance is evaluated 
separately for b tagging, c tagging and c tagging within a sample of b and c jets only 
(be). 



parameter 


IP fit 


vertexing (ZVRES) 


flavour tag 


X^/ndf of track fit 


5 


4 




R-(j) impact parameter do (mm) 


20 


2 


20 


do uncertainty (mm) 




0.007 




z impact parameter zq (mm) 


20 


5 


20 


zq uncertainty (mm) 




0.025 




track pt (GeV) 


0.1 


0.2 


0.1 



Table 2 

Track selection used for IP fit, vertexing and calculation of the flavour tag inputs. 



are needed. 



3 The ZVTOP vertex finding algorithms 

The LCFIVertex package contains a complete re-implementation of the topo- 
logical vertex finder ZVTOP originally developed at the SLD experiment [17j. 
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A modified version of tlie original Fortran code was used in previous Linear 
Collider studies, see e.g. Refs. [Tp] . For the most part, the vertex finding al- 
gorithm in LCFIVertex, as described below in this section, corresponds to the 
original SLD version. Minor improvements of the LCFIVertex algorithm with 
respect to SLD include a Kalman vertex fit and adjustments to allow use of 
ZVTOP in events at centre-of-mass energies above the Z resonance. 

3.1 The ZVRES algorithm 

The ZVTOP vertex finder provides two complementary algorithms which use 
topological information to identify track combinations that are likely to have 
their origin at a common vertex. The first of these, the ZVRES algorithm, 
can be used to find multi-pronged secondary vertices with an arbitrary ge- 
ometrical distribution and hence is most generally applicable provided the 
detector system has a sufficiently high spatial resolution. The object-oriented 
implementation of the code that forms part of the LCFIVertex package is de- 
scribed in detail elsewhere [18]. Compared to the original implementation, the 
new code provides several improvements and adjustments to a collider envi- 
ronment with a variable centre-of-mass energy. The remainder of this section 
outlines the vertexing algorithms and describes these modifications. 

A central idea of the ZVRES algorithm is to describe each track i by a prob- 
ability density function fi{r) in three-dimensional space and to use these to 
define a vertex function V{r) that yields higher values in the vicinity of true 
vertex locations and lower values elsewhere, as well as providing a criterion 
for when two vertex candidates are resolved from each other. 

The track functions have a Gaussian profile in the plane normal to the tra- 
jectory. With p the point of closest approach of track i to space point r*, the 
track function fiif) is defined as: 



with the second term ensuring that V{r) approaches zero in spatial regions 
in which only one track contributes significantly to the first term and where 
hence no vertex should be found. 




where Vj is the position covariance matrix of the track at p. 



In its most basic form, the vertex function is defined as 
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Optionally, further knowledge on where vertices are more likely to be found 
can be used to weight the vertex function, thereby suppressing fake vertices 
and increasing the purity of the vertices found (i.e. the fraction of correctly 
assigned tracks). Knowledge of the IP position can be used to suppress fake 
vertices from tracks passing close by each other in the vicinity of the IP. This 
is accomplished by representing the IP by a contribution 



/o(f) = exp (r - Yjp (f - 



where p is the position of the IP and V/p the covariance matrix describing 
the accuracy with which this position is known. This new term contributes 
to the vertex function in the same way as the Gaussian probability functions 
representing the tracks, hence the vertex function is redefined as 



N „,,2 f2t^ I v^Af 



w 



IP 



=1 



WiPm+EliMr) 



This definition ensures that space points close to the IP are less likely to be 
resolved from each other and that tracks that could otherwise give rise to fake 
vertices are more likely to be assigned to the primary vertex. In the default 
configuration of the code, the IP contribution is given a weight of wjp = 1. 




Fig. 3. Schematic diagram showing the definition of angle a. 

Similarly, it is kinematically favoured for heavy hadrons to decay close to the 
jet axis rather than at large angle from it, which is taken into account by 
weighting the vertex function outside a cylinder of radius 50 /im by the factor 
exp"^"" . In this expression, the angle a is defined as shown in Fig. [3] and 
Ka = kEjc,t with k being an LCFI Vertex parameter which the user can set and 
Ejet the jet energy. The jet energy dependent definition of implemented in 
LCFIVertex takes into account that jets of higher energy are more coUimated. 
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In addition to indicating likely vertex positions, the other main use of the 
vertex function is to provide a key criterion for merging candidate vertices in 
the process of vertex finding: space points fl and are defined to be resolved 
from each other if along the straight line connecting these points the vertex 
function falls below a given fraction Rq of the lower of the values V{fi) and 
V{f2)- Vertices that are not resolved according to this criterion will be referred 
to as unresolved from each other. 

The main challenge of many vertex finders used prior to ZVTOP is the large 
number of track combinations that need to be considered to determine whether 
they form a good vertex. In contrast, the ZVRES algorithm uses a bottom- up 
approach, starting out from all possible two-track combinations and using the 
vertex function as well as the fit to decide which candidates to keep and to 
merge. 

In the initial step, all two-track and optionally all track-IP combinations are 
fitted with a Kalman vertex fit. Of these two-object fits, those with lower 
than a threshold Xo vertex function at the fitted vertex position fVert 
above a threshold Vq are retained. Before beginning the merging, the number 
of remaining two-object fits to be further considered is reduced as follows: for 
each track in turn, all the two-object combinations that contain the track are 
considered. The track is removed from all vertices with V(rvcrt) below 10 % of 
Knax(^Vcrt)j the maximum vertex function value obtained from the fits for the 
track under consideration. The track's two-object fits are sorted with respect 
to their vertex function ^(rvert)- All vertex candidates for which both objects 
have been removed at this stage are discarded. 

The remaining candidate vertices are then merged making further use of the 
resolvability criterion: starting out from the candidate vertex with the highest 
vertex function in the sorted list found in the previous step, a set of unresolved 
vertices is found by iteratively adding other candidates that are unresolved 
from any of the vertices in the set. The resulting set of unresolved candidate 
vertices is then merged to form a new candidate. From the remaining vertices, 
the next seed in the sorted list is picked and the process continues until all 
original candidate vertices have either been absorbed or considered as seeds 
for an unresolved set. Note that for the merging phase, the vertex function is 
not evaluated at the position rVcrt of the original two-object candidate vertex, 
but at the closest local maximum fuAX of the vertex function, in order to 
improve the suppression of fake vertices. 

Following the merging phase, tracks with a high contribution are removed 
from the resulting candidate vertices: iteratively, the track with the highest 
contribution is removed if its x"^ is above a threshold Xtrim- The vertex 
is refitted and this step is repeated until the highest x^ track passes the cut. 
Candidates that, after this procedure, are no longer associated with either at 
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least two tracks or with the combination of the IP object and 0, 1 or more 
tracks, are discarded. 

Remaining ambiguities in the association of tracks to vertices are resolved by 
keeping each track only in the vertex candidate with highest V{fMAx) and 
removing it from all others. The resulting track combinations are fitted to 
yield the final vertices, which are sorted with respect to their distance from 
the IP. 



3.2 The ZVKIN algorithm 

The ZVTOP vertex finder also provides an algorithm, ZVKIN, to address 
a particular category of jets for which the ZVRES algorithm fails, namely 
h jets with two subsequent one-prong decays (i.e. decays in which only one 
charged track is detected), see e.g. Ref. [IH] for a description of the algorithm 
at SLD, and Ref. for further details of the C++ implementation. For cases 
with one multi-prong vertex and one single-prong vertex a recovery algorithm 
can identify the decay track and subsequently add it to the decay chain (see 
Section [4.21 for details). However, if only one charged track can be detected 
from both the B vertex and the D vertex, a different approach is needed. 
The solution chosen in the ZVKIN algorithm is to use additional kinematic 
information by approximating the direction of flight of the B hadron and 
forming a "ghost track" from it. This ghost track, described as a straight line 
with an appropriate circular error ellipse of constant width along the track, is 
added to the set of tracks from which vertices are found. The algorithm consists 
of two main stages: first the ghost track direction and width are found using 
an iterative minimisation approach. In the second stage this ghost track is 
used to constrain the secondary vertex finding. 

The jet axis is chosen to initialise the ghost track direction. Fixed at the 
position of the IP, the ghost track G is swivelled in both 9 and (j) directions, 
until the value 



is minimal, where xj is the of the vertex fit found using track i and G and 
Xoi = Xi{Li = 0), and i runs over all input tracks. The value Lj is the distance 
from the IP to the projection of the vertex onto G, with the sign chosen to 
be positive if the vertex is in the hemisphere defined by the direction of the 
jet axis. Minimisation continues until changes in both 6 and by O.lmrad 
do not yield further improvement. If the initial direction of G is relatively far 
from the true line of flight of the B hadron, then some, or possibly all, of the 
secondary tracks will yield a vertex with G that has a negative value of Lj. 
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The first minimisation stage is designed so that the contribution to xsi from 
tracks with Lj < will tend to push the ghost track G towards the B flight 
path in the minimisation process. The 2xoi term ensures that the Xsi changes 
in a continuous manner at Lj = 0. 

At this point, the width of the ghost track is calculated such that the largest 
value of Xi for the vertex fit of any of the tracks with G is equal to 1. If it 
is above the user-settable minimum width (5G,min required for G, it is used in 
the next step; otherwise the minimum width is used. With the adjusted ghost 
track width, the ghost track direction is further optimised, using the same 
algorithm as before, but this time minimising the quantity 



The width is then adjusted as before. This choice ensures that the correspond- 
ing vertex probability has an approximately fiat distribution between and 1, 
while fake vertices yield values close to 0. The vertex probability is calculated 
from the value of the vertex fit following the standard algorithm described 
in 

The resulting optimised ghost track is then used to find vertices as follows: 
from each track in the jet, a vertex is formed with the ghost track. The IP 
is added to this initial set of vertices. For all possible pairs of vertices from 
this set, the vertex probability is calculated, omitting the ghost track from 
the combinations that contain the IP. The pair that maximises this probabil- 
ity is merged and replaces the original vertices in the list. Combinations with 
vertex probability below a threshold value Py.min are not accepted, while the 
original vertices which formed the combination are separately retained. The 
merging process continues until no further combination yields a vertex prob- 
ability above Pv,min or until there is only one vertex left. Finally, the ghost 
track is removed from all vertices without refitting them, with the possibility 
that secondary vertices contain only one track. 

3.3 Performance of vertex finding for a typical ILC vertex detector design 

All results presented in this section were obtained with the ZVRES algorithm, 
described in Section [XTl which applies to a broader class of jets than ZVKIN. 
A pure sample of jets from the process e"'"e^ 7/Z bb at ^/s = 91.2 GeV 
was used, permitting a direct comparison with earlier studies at that energy. 
Figure 111(a) shows the probability of the secondary vertex fit. This variable 
cross-checks the vertex fitter, a reasonably fiat distribution, as seen in the 
figure, indicating that the fitter performs as expected. For a study of the 
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Fig. 4. (a) Probability corresponding to the value calculated from the vertex fit 
and (b) efficiency for finding a secondary vertex, for a pure sample of h jets. 

dependence of vertex finding on the decay length of the B hadron, a sub- 
sample was selected, consisting only of jets in which a hadron decayed to 
a charged D hadron, either directly or via a short-lived D* resonance (with the 
D* lifetime being so short that it does not travel any measurable distance in 
the detector, but decays essentially at its point of origin). The decay lengths 
of the B and the D hadron which were chosen in the MC simulation were 
determined for each jet and compared to the values reconstructed by the 
ZVRES algorithm. For this comparison the efficiency of finding a secondary 
vertex within a given decay length interval was plotted as a function of the 
true MC B hadron decay length, as shown in Fig. H] (b). 

Using the same sub-sample of jets as in Fig. H] (b), the reconstructed decay 
lengths of the B and D hadron were compared to the corresponding MC decay 
lengths on a jet-by-jet basis. In Fig. O (a), the B decay length comparison is 
shown for jets in which at least two vertices (the primary, or IP vertex, plus 
at least one secondary vertex) were found. This class of jets includes cases in 
which the D hadron decayed so close to the B decay vertex that it could not 
be resolved from it, and for which therefore the reconstructed decay length is 
shifted to larger values compared to the MC B decay length. Short B decay 
lengths, not permitting the B decay vertex to be resolved from the primary 
vertex, also result in deviation of the reconstructed secondary vertex decay 
length from the MC truth value. In Fig. [5] (b), the decay length of the D 
hadron decay is compared to the MC value for jets in which exactly three 
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Fig. 5. Reconstructed decay length vs. MC decay length for (a) B hadron decays in 
b jets in which at least two vertices have been found and (b) D hadron decays in b 
jets with three ZVTOP vertices. 

vertices were reconstructed by ZVRES. As expected for this category of jets, 
the correlation between reconstructed and MC values is better. 



Monte Carlo 
track origin 


Reconstructed t 
Two vertex case 


rack-vertex association 
Three vertex case 




pri sec iso 


pri sec ter iso 


91.2 GeV Primary 
B decay 
D decay 


91.5 1.4 36.2 

6.7 46.7 29.6 

1.8 51.9 34.2 


95.2 3.1 1.9 49.9 
3.1 75.3 10.6 22.8 
1.7 21.6 87.5 27.2 


500 GeV Primary 
B decay 
D decay 


93.7 2.6 35.3 

4.6 47.3 29.8 

1.7 50.1 34.9 


97.4 4.9 4.0 48.5 
1.8 72.3 13.5 24.5 
0.8 22.9 82.5 27.0 



Table 3 

Percentages of tracks assigned to the reconstructed primary, secondary and tertiary 
vertex and of tracks not contained in any vertex (labelled "iso") which originate 
from the IP, the B or the D decay at MC level, for b jets. 



In addition to the precision with which the decay lengths can be reconstructed, 
important for many physics studies, it was investigated to which extent the 

track content of the reconstructed vertices is correct. For each type of recon- 
structed vertex — the primary, the secondary and, if available, the tertiary 
vertex — the average percentage of tracks that originated from the correspond- 
ing MC decay vertex was determined, yielding the purity of the track content 
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Monte Carlo 
track origin 


Reconstructed t 
Xwo vertex case 


rack-vertex association 
Three vertex case 




pri sec iso 


pri sec ter iso 


yi./Ljev r^riniary 
D decay 


yo.y 0.6 M .u 
4.1 93.7 23.0 


OK a 1^ 90 1 77 7 

yo.o ou.o zy.i ii.i 
3.2 69.5 70.9 22.3 


500 GeV Primary 
D decay 


97.5 9.0 73.1 
2.5 91.0 26.9 


96.7 20.2 51 73.6 
3.3 79.3 48.7 26.4 



Table 4 

Percentages of tracks assigned to the reconstructed primary, secondary and tertiary 
vertex and of tracks not contained in any vertex (labelled "iso") which originate 
from the IP or the D decay at MC level, for c jets. Fractions missing from 100% 
are due to h jets arising from gluon splitting. 



Monte Carlo 
track origin 


Reconstructed t 
Two vertex case 


rack-vertex association 
Three vertex case 




pri sec iso 


pri sec ter iso 


91.2 GeV 6 jets 
c jets 


50.2 35.8 13.9 
65.1 27.3 7.6 


39.6 28.9 23.6 8.0 
50.6 22.0 20.5 7.0 


500 GeV b jets 
c jets 


57.6 27.3 15.1 
74.3 17.5 8.3 


52.5 20.6 16.3 10.6 
65.4 13.7 13.0 7.9 



Table 5 

Percentages of tracks assigned to each type of reconstructed vertex or left unassigned, 
for b and c jets. 

of each type of vertex. These purities are given in Table [3] for b jet^jand in 
Table m for c jets, separately for the cases that exactly two and exactly three 
vertices were reconstructed by ZVRES. The percentages of tracks that were 
found to be contained in each vertex category or left unassigned are given in 
Table [5l The study was performed at both ^/s = 91.2 GeV and at the initial 
maximum energy of the ILC, -^i = 500 GeV. The best assignment of tracks to 
vertices, corresponding to the highest purities, was obtained for c jets with two 
reconstructed vertices and for b jets with three reconstructed vertices. This is 
understandable given the fact that if both the b and the subsequent charm 
decay result in multi-prong vertices, this corresponds to a cleaner topology 



^ Note that the normalisation is chosen differently from the earlier SLD table |17j : 
percentages are normalised to the total number of tracks in each type of recon- 
structed vertex, whereas for the former result percentages were given with respect 
to the total number of tracks in each type of MC vertex. 



17 



that can be more easily reconstructed. If a smaller number of vertices is found 
in b jets, this can indicate either that one of the decay vertices is one-pronged 
and thus cannot be found by ZVRES, or that the decay length of one of the 
heavy flavour hadrons is so short that its decay vertex cannot be resolved from 
the preceding vertex in the decay chain, and that, for example, the secondary 
vertex found by ZVRES contains some tracks that actually originated from 
the D decay and some from the B decay. Both effects can result in a misas- 
signment of tracks by ZVRES and hence to a reduced purity for vertices in 
this category of jets. Similarly, the sample of c jets with three reconstructed 
vertices will contain a higher rate of fake vertices than the c jet sample with 
two reconstructed vertices, again corresponding to a higher confusion in the 
track-to-vertex association. 



Comparing the results at 91.2 and at 500 GeV centre-of-mass energy, an in- 
crease in the available energy results in an increase in the number of tracks 
originating from the primary vertex. It is thus expected that the percent- 
age of tracks assigned to the primary vertex by ZVRES increases, as seen 
in Table O This increased multiplicity of IP tracks, in combination with jets 
becoming more collimated at higher energies, makes vertex finding more chal- 
lenging, even though these effects are partly compensated by increasing decay 
lengths. The net effect is an increased confusion in the track-to-vertex as- 
signment, which is most pronounced for the tertiary vertex in three- vertex 
c jets, for which the percentage of IP tracks increases from about 30 % at 
= 91.2 GeV to about 50 % at -y/i = 500 GeV. The relatively large changes 
of these numbers with centre-of-mass energy indicate that these effects will 
need to be studied in more detail in the future; in particular it would be worth 
investigating if performance at higher energy can be improved by adjusting 
the energy dependence of the ZVRES parameters. 



The performance studies of the vertex reconstruction at a centre-of-mass en- 
ergy of 91.2 GeV allow a direct comparison with results obtained at the SLD 
experiment, with vertex detectors VXD2 [T7j and VXDS [21]. The improved 
angular coverage, point resolution and reduced material budget envisaged for 
the ILC vertex detector are expected to result in significant improvements in 
performance over SLD. Indeed, the vertex finding efficiency for the ILC ver- 
tex detector model, shown in Figure HI is clearly improved compared to the 
earlier SLD results, increasing rapidly at low decay lengths and reaching an 
average value of 89 % in the plateau region above decay lengths of 1 mm. In 
comparison, for SLD- VXDS, the plateau was only reached for decay lengths 
of about 2 mm, with the efficiency above that value being about 80 % [22] . 
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4 A flavour tag procedure based on neural networks 



The LCFIVertex software package contains a neural network based jet flavour 
tag modeled closely after an earlier Fortran-based implementation by R. Hawk- 
ings [23j. A distinctive feature of this approach is a separate treatment of jets 
with and without non-IP vertices. Beyond being a full re-implementation of 
this algorithm in C++, the LCFIVertex package features a high degree of flex- 
ibility concerning neural network architecture, choice of input variables and a 
tool to determine the relevance of individual neural network inputs. The actual 
neural network setup and parameters for LCFIVertex are deflned by external 
flies loaded at run time. These flies can be maintained and distributed indepen- 
dently of the LCFIVertex code, allowing the provision of central repositories 
of neural network flies tuned to speciflc detector models and/or centre-of-mass 
energies. 



4-.1 Determination of true jet flavour, hadron and quark charge from MC 



In order to deflne performance measures for flavour tag and vertex/quark 
charge, the true jet flavour and the charge of the leading hadron and of the 
heaviest quark contained in it need to be known for comparison. A dedicated 
part of the LCFIVertex package implements the following algorithm to extract 
this information from the event record of the MC generator that is included in 
the LCIO event: the event is searched for all hadrons containing 6 or c quarks. 
These hadrons are assigned to the reconstructed jet closest in angle, with the 
possibility of assigning more than one heavy hadron per jet. From the hadrons 
assigned to a given jet, the one appearing earliest in the MC decay chain is 
selected and the jet assigned the flavour of the heaviest quark contained in it 
as true jet flavour. 

For two-jet events, the true jet flavour is clearly deflned and easily obtained 
from this procedure. In multi-jet events, where the angular distance between 
jets is smaller and the assignment of tracks to jets sometimes becomes ambigu- 
ous, there may be cases of tracks corresponding to hadrons from the same par- 
ton shower being assigned to different jets, with the concept of a "jet flavour" 
becoming less clearly deflned. However, these are exceptions related to the 
general difficulties of jet finding in such events, and the algorithm described 
above yields good results also for most jets in multi-jet events. 
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4-2 Observables sensitive to jet flavour 

Many of the variables most sensitive to jet flavour are only defined for jets in 
which non-IP vertices have been found. Therefore, different sets of observables 
are used for jets containing one and jets containing more than one vertex. 

In the case that only one vertex — the event vertex — has been found, the 
input jet is searched for the two tracks of highest impact parameter signifi- 
cance^ in the R-cp plane. These are referred to as the most significant and the 
second-most significant track in what follows. For finding these two tracks, 
separate minimum momentum cuts ptrk,NL,min and Ptrk,NL-i,min are applied for 
tracks with hits on all Nl vertex detector layers or with hits on only A^^^ — 1 
layers, respectively. The momenta |j9trk| and impact parameter significances of 
these tracks in the R-(j) and R-z planes are used as input for the flavour tag. 
A single track of high impact parameter significance may indicate that the 
jet under consideration is a charm jet with the leading having decayed to 
a single charged track ("one-prong" decay), which is expected for 40 % of 
all decays [24j. The observables obtained from the second-most significant 
track help distinguish between c and b jets, for which it is more likely that 
two tracks of high impact parameter significance are found, typically with one 
resulting from the decay of the leading hadron and one from the decay of the 
charmed hadron produced in that decay. Fig. [6] shows distributions of the in- 
puts for the most significant and second-most significant tracks, separately for 
b, c and light flavour jets. It is worth noting that a small positive tail of im- 
pact parameter significances is observed in uds jets, where the largest positive 
impact parameter significance is occasionally contributed by a single misre- 
constructed track in the jet. These tracks lead to an important background 
for the identification of one-prong charm decays. 

Further information is contained in the "joint probability" for all tracks to 
originate from the primary vertex, as introduced by ALEPH [25]. The first 
implementation of this variable for an ILC detector is described elsewhere 
[23] . Two joint probability variables are calculated from the impact parameter 
significances in R-(j) and in R-z of all the tracks in the jet that pass the 
specific selection criteria as detailed below. The distribution /(x) of unsigned 
impact parameter significances for IP tracks is assumed to be known; it can 
be determined from the data, as described in Appendix \^ The probability of 
an IP track having an impact parameter significance of or larger is given 



^ The impact parameter of a track is defined as the distance between the track's 
point of closest approach to the IP and the IP. The impact parameter significance 
is the impact parameter divided by its uncertainty. 
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most signif. trk, log|p | / GeV 2nd most signif. trk, log|p | / GeV 



Fig. 6. Flavour tag inputs based on the most [plots (a), (c), (e)] and second-most 
[plots (h), (d), (f)] significant track in the jet. Shown are the impact parameter 
significance in R-(j) in (a), (h), the impact parameter significance in R-z in (c), (d) 
and the track momentum in (e), (f). 
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lo^ fix)dx • 

For a set of tracks the probability that all tracks originate from the IP 
is 

Pj = y2^ — — ' y = [[Pi ■ 



k=0 



The joint probability is the observable Pj, calculated for the set of tracks that 
pass the track selection cuts described in Section [2121 as well as an upper cut on 
impact parameter of 5 mm and on impact parameter significance of 200. It is 
calculated separately for the R-(f) and the R-z impact parameter significances. 
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Fig. 7. Joint probability for all tracks in the jet passing the track selection cuts to 
originate from the primary vertex. This probability is calculated separately from (a) 
the R-4) impact parameter significances and (b) the impact parameter significances 
in R-z. 

As can be seen from the resulting Pj distributions shown in Fig. [TJ light quark 
jets tend to have values closer to 1, while the distributions for h and c jets 
peak at zero. 

In the case that more than one vertex is found, observables derived from these 
additional vertices provide a more powerful means to distinguish between 6, c 
and light quark jets. The following set of eight variables is used in that case: 



• The decay length and decay length significance of the vertex with the largest 
decay length significance in three dimensions with respect to the IP; 

• The momentum \p\ of the set of tracks assigned to the decay chain (see 
below) ; 

• The pT-corrected vertex mass, calculated as described below; 

• The number A^trk,vtx of tracks in all non-primary vertices; 

• The secondary vertex probability of the tracks assigned to the decay chain; a 
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new vertex fit is performed using tliese tracks and the probability calculated 
from the fit ~x^] 

• The joint probability in and in R-z as described above. 




Fig. 8. Schematic showing the definition of distances L, D and T used in the selection 
of tracks for vertex mass determination. 

In addition to the general track selection for the calculation of the flavour tag 
variables, a special track selection is applied for the variables Mpt, \p\ and 
the secondary vertex probability as follows: the "seed vertex" , i.e. the vertex 
furthest away from the IP, is used to define the distances D between this 
vertex and the IP, and L as shown in Fig. [81 The "vertex axis" is the straight 
line connecting the seed vertex with the IP. For each track, the point of closest 
approach to the vertex axis is projected onto the vertex axis and L defined as 
the distance of the resulting point on the vertex axis from the IP. Tracks with 
0.18 < L/ D < 2.5 and a transverse distance T of the point of closest approach 
from the vertex axis below 1.0 mm are attached to the decay chain and used 
in the calculation. Note that in the default configuration all tracks from all 
vertices except the primary vertex are included automatically. Optionally, the 
track attachment cuts can be applied also to the tracks in the seed vertex. 

The momentum \p\ is the modulus of the vector sum of all decay chain track 
momenta. The secondary vertex probability is found by fitting a common 
vertex to these tracks and calculating the probability from the value of 
this fit in the same way as for the ZVKIN vertex finder, see Section 13.21 For 
the secondary vertex probability the number of tracks in the decay chain is 
required to exceed the value A^trks,min and the normalised fit-x^ is required 
to be below a user-settable value: x^/yriM < Xnormmax) where ndf is the 
number of degrees of freedom. For jets that do not meet these requirements, 
the probability is set to 0, to lower the risk of such jets leaking into the heavy 
flavour samples. 
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For the calculation of the Pr-corrected vertex mass Mpt, first the vertex mass 
before correction Mytx, is obtained from the decay chain tracks, assigning pion 
mass to each track. With ^vtx the angle between the seed vertex axis as given 
by the vertex position and the vertex momentum with respect to the IP, it is 
required that ■ (1 — cos^ 6'vtx) < u^pt,max ■ Afytx? where the factor ti'pt,max is a 
user-defined LCFI Vertex parameter of default value 3. This cut ensures that 
cases in which both ^vtx and |p| are large are excluded from the correction 
procedure to reduce the risk of fake vertices being assigned a large correction 
and subsequently affecting the flavour tag. Jets failing this cut are assigned 
an Mpt value of 0. A conservative estimate of the transverse momentum p^*^ 
corresponding to ^vtx and taking the error matrices of the seed axis and the 
IP into account, is obtained by iteratively minimising the correction term p^*^ 
and recalculating the seed axis direction. For this minimisation, the parameter 
Ncr,max determines the permitted extent of the seed axis correction in units of 
its uncertainty, larger values of A^o-,max permitting larger changes. The Pt- 



corrected vertex mass Mpt is then deflned as Mpt = y M^^^ + |p^tx|^ ^ |pVtx|^ 
Finally, it is required that the correction does not exceed the uncorrected value 
by a large factor, Mpt < Wcorr,max ■ ^vtx, where Wcorr,max is a code parameter. 

Figure [9] shows the flavour tag input variables used for jets for which more than 
one vertex was found, for b, c and light quark jets separately. Some of these 
variables, such as Mpt, already provide very good separation of the different 
jet flavours on their own, with correlations between the observables, exploited 
by the neural network approach, further improving the tagging performance. 

Some of the input variables of the flavour tag depend on the energy of the 
input jet. In order to be able to use the neural networks that are trained 
with jets from the e~^e~ ^ Z/'-f ^ qq events at the Z resonance for arbitrary 
energy, the momenta of the most and second-most signiflcant track, the decay 
length signiflcance and the seed vertex momentum are normalised to the jet 
energy before being fed into the neural nets. 



4-3 Combining flavour-sensitive variables using neural networks 

For heavy flavour tagging, that is the identiflcation of bottom and charm 
jets, neural networks are trained such that the target output provided in the 
training phase is 1 for signal jets and for background. The output value of 
a trained network will be the closer to 1 the more signal-like the values of the 
input observables. The LCFIVertex code is very flexible, permitting the use 
of different input variables, network architecture, node type, transfer function 
and training algorithm. 
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Fig. 9. Flavour tag inputs that are used if at least two vertices (at least one sec- 
ondary) are found in the jet. 
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Fig. 10. Output of the neural networks used for charm tagging. The plots show the 
outputs for the three separate networks used in case (a) one, (b) two and (c) three 
or more vertices are found in the input jet. In (d), the resulting distribution for 
arbitrary number of vertices is shown. 

By default, each of the networks is a muhilayer perception with 8 input nodes, 
one hidden layer of 14 tan-sigmoid nodes and one output node and is trained 
using the conjugate gradient back propagation algorithm. As explained in 
Section 14.31 the flavour tag is based on different observables for jets with one 
and for jets with two or more found vertices. Furthermore, for a given jet 
flavour, the distributions of sensitive variables are significantly different for 
jets with two and jets with three or more vertices, so the ability to distinguish 
between b and c jets is enhanced by treating these two cases separately. For 
each of these categories of one, two or at least three vertices, three networks 
are trained, so a complete set consists of nine networks altogether: for "6 nets" , 
the signal provided in the training phase consists of b jets while c and light 
flavour jets form the background. The "c nets" are trained with c jets as signal 
and b and light flavour jets as background. As for some physics processes the 
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background for the identification of c jets is known to consist of b jets only, 
and charm jets are easier to distinguish from these than from hght flavour 
jets, dedicated networks are provided for this case, which are only presented 
b jets as background in the training run. 




Fig. 11. Charm tag vs. bottom tag for input samples consisting purely of (a) bottom 
jets, (b) charm jets and (c) light quark jets. 

Figure [lUl shows the distribution of the output variables of neural networks 
used for tagging charm jets (c nets) separately for the cases of one, two and 
at least three vertices, and the combined distribution for an arbitrary number 
of vertices, for the sample of two-jet events at a/s = 91.2 GeV. The most 
straightforward way of using the charm tag (i.e. c net output) in an analysis 
is to require one or more jets in an event to have a charm tag exceeding a 
certain cut value, chosen as appropriate for the specific analysis. Resulting 
performance on a jet-by-jet basis is discussed in Section 14. 5[ Event selection 
can be improved by using information from both the charm and the bottom 
tag. This can, for example, be achieved by plotting charm versus bottom tag, 
as shown in Fig. [Tl]for bottom, charm and light flavour jets from the two-jet Z 
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peak sample, and placing a cut on the resulting two-dimensional distribution. 
Note that in the two-dimensional distribution for c jets, the peak near {b- 
tag = 0, c-tag = 0) stems from jets in which only the primary vertex was 
reconstructed, while the peak near (0, 1) is due to jets in which secondary 
vertices were also found. 



4-4 Functionality provided by the LCFIVertex neural net code 

The flavour tag described in the previous section is based on a neural network 
approach. Within the LCFIVertex package, neural network code implement- 
ing flexible multi-layer perceptrons is provided which was originally developed 
as a standalone package. In addition to the flavour tag processor already de- 
scribed, a dedicated Marlin processor is provided to train new networks. This 
can, for example, be used for training dedicated networks for specific analy- 
ses or detector geometries (although as a general rule the networks trained 
with two-jet events provide excellent tagging performance and are very widely 
usable). Furthermore, this processor provides an example of how a neural net- 
work can be set up and trained, for users who wish to change the flavour 
tagging approach, e.g. by using further input variables, different network ar- 
chitecture and/or training algorithms, or who would like to set up networks 
for new purposes such as the tagging of r leptons, currently not available in 
the LCFIVertex code. This section gives an overview of the functionality of 
the generic neural network code provided. 

In a multi-layer perceptron, the value Oj that is passed to the transfer function 
of neuron i is obtained from the weighted sum of the outputs tj from all 
neurons in the preceding layer, with the option to subtract a bias value Wib. 
O'i = SjLi tjWij — Wib- Four training algorithms to adjust the weights of the 
network are implemented, with the initial weights set to random values. The 
four training algorithms are: 

(1) the back propagation algorithm; 

(2) the back batch propagation algorithm; 

(3) the back propagation conjugate gradient (CG) algorithm; 

(4) the genetic algorithm. 

These are described in more detail in the literature on neural nets, for intro- 
ductions to the subject see e.g. Refs. [2SI2Z1- Algorithms 1 and 2 only differ in 
the number of training items processed per iteration; algorithm 3 uses a con- 
jugate gradient approach to minimise the error function by optimal choice of 
the two parameters (learning rate and momentum constant) that influence the 
weight changes at each training step. This algorithm converges considerably 
faster, by a factor of order 10, than the simple back propagation algorithm 
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and is used as the default training algorithm in the LCFIVertex package. Al- 
gorithm 4 is not based on error function minimisation, but during the training 
phase works with a set of neural networks, with the probability of one or more 
copies of a network being retained from one training step to the next being 
the larger the better a network performs on the training sample. 



4.5 Performance of the flavour tag 
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Fig. 12. Comparison of tagging performance at the Z resonance and at 
y/s = 500 GeV . Tagging purity is plotted as function of efficiency for b jets and 
c jets. Performance for c jets assuming only b background (labelled "be") is also 
shown. 

As a measure of the flavour-tagging performance, the purity of selecting bot- 
tom and charm jets is studied as function of the efficiency. Fig. [T2] shows purity 
vs. efficiency for the three tags provided by the LCFIVertex package, for the 
two-jet sample at i/i = 91.2 GeV and at y/s = 500 GeV. A flavour composi- 
tion of approximately 22 % (15 %) of bottom, 17 % (25 %) of charm and about 
61 % (60 %) of light flavour jets at = 91.2 GeV (500 GeV) is assumed. The 
plot is obtained by varying a simple cut on the neural net output variable 
and calculating purity and efficiency at each cut value. At the Z resonance, 
for the 6-tag a very pure sample, containing 92% b jets, can be selected at 
an efficiency of 70%. In comparison, high c-tag purities can only be achieved 
at lower efficiencies, mainly due to contamination from light flavour jets. The 
c-tag with all backgrounds included has been found to be the most sensitive 
of the tags whenever changing boundary conditions for the study, such as 
using different tracking algorithms, code parameters and detector geometry. 
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At 500 GeV, the 6-tag performance is degraded with respect to that at the Z 
resonance, while c-tag purity is very similar at both energies and the purity 
of the c-tag with only h background is slightly improved at the higher energy. 




FT iq HJ- FT iq 

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 

b-tag efficiency c-tag efficiency 



Fig. 13. Efficiencies for selecting jets with the wrong flavour when tagging (a) bottom 
jets and (h ) charm jets. 

A way of studying tagging performance that is independent of the sample 
composition is to look at the efficiency of selecting each of the "wrong" flavours 
when cutting on one of the tagging variables, i.e. the wrong flavour efficiencies. 
Figure [13] shows the c and light quark jet efficiencies as a function of the 6-tag 
efficiency and the h and light flavour jet efficiencies vs. the c-tag efficiency. 
It is clearly seen that the main background to the 6-tag is due to c jets. For 
the c-tag the main background at efficiencies above about 75 % is due to light 
flavour jets, while at lower efficiencies it is dominated by misidentifled h jets, 
as would be expected from the comparison of the purities for c-tag and c-tag 
with h background only. 

The relative importance of the input variables on the flavour tag results for 
the different contributing neural nets was determined using an estimator also 
used in the "Toolkit for Multivariate Data Analysis with ROOT" (TMVA) 
[28] . Following this approach, the input importance of variable i is deflned as 

rih ^ 

i=i 

where Xj is the average of the values of variable i in the input sample and 
the sum extends over the weights Wij corresponding to the connections of the 
neural network node of variable i with the nh nodes in the adjacent network 
layer. The calculation was implemented as part of the neural network code 
provided with LCFIVertex and this information is provided by the flavour tag 
processor as part of the run summary. Tables [H] and [7] summarise the results 
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1 vertex found by ZVTOP 


variable 


o-tag 


c-tag 


c-tag (b bgd.j 


most sigmi. track, ao/o'("oj 


0.002 lb 0.004 


0.005 zb 0.008 


0.000 ± 0.000 


2 "-most sigmi. track, ao/o"((Xoj 


0.058 lb 0.016 


0.009 zb 0.007 


0.026 ± 0.010 


most signif. track, ZQ/a{zQ) 


U.UUi ± O.OUi 


U.UOo ± U.UOz 


U.UUz ± U.UUI 


2'^'^-most signif. track, zo/a{zo) 


U.iio ± U.UoU 


U.UUO zh U.UUI 


U.UOD zh U.Ul 1 


most signif. track, |ptrk| 


0.113 ±0.005 


0.275 ± 0.015 


0.043 ± 0.002 


2"'*-most signif. track, |ptrk| 


0.062 ± 0.003 


0.068 ± 0.006 


0.048 ± 0.003 


joint probability, R-cp 


1 


0.676 ± 0.031 


0.887 ± 0.041 


joint probability, R-z 


0.922 lb 0.037 


1 


1 


2 vertices found by ZVTOP 


variable 


6-tag 


c-tag 


c-tag {b bgd.) 


Mpt 


1 


1 


1 


\P\ 


0.098 ± 0.007 


0.404 zb 0.029 


0.114 ±0.008 


decay length 


0.182 lb 0.013 


0.990 ± 0.072 


0.139 ±0.010 


decay length significance 


0.070 ± 0.008 


0.187 zb 0.022 


0.115 ±0.013 


-^trk,vtx 


0.063 lb 0.004 


0.162 zb 0.009 


0.081 ± 0.004 


secondary vertex probability 


0.230 lb 0.017 


0.212 zb 0.016 


0.124 ±0.010 


joint probability, R-cp 


0.040 zb 0.004 


0.071 zb 0.007 


0.052 ± 0.005 


joint probability, R-z 


0.061 ± 0.008 


0.159 zb 0.021 


0.052 ± 0.007 


3 vertices found by ZVTOP 


variable 


6-tag 


c-tag 


c-tag (6 bgd.) 


Mpt 


1 


1 


1 


\P\ 


0.490 lb 0.076 


0.960 zb 0.148 


0.580 ± 0.089 


decay length 


0.550 zb 0.119 


0.338 zb 0.072 


0.370 ± 0.080 


decay length significance 


0.081 ± 0.018 


0.106 ± 0.023 


0.078 ± 0.017 


-^trk.vtx 


0.665 ± 0.089 


0.811 ±0.108 


0.829 ±0.111 


secondary vertex probability 


0.087 ± 0.045 


0.112 ±0.057 


0.066 ± 0.034 


joint probability, R-(j) 


0.006 ± 0.002 


0.012 ± 0.005 


0.007 ± 0.003 


joint probability, R-z 


0.007 ± 0.003 


0.017 ±0.007 


0.009 ± 0.003 



Table 6 

Relative importance li/Imax of variables used as inputs for flavour tag neural nets 
at the Z resonance. 
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1 vertex found by ZVTOP 


variable 


6-tag 


c-tag 


c-tag (6 bgd.) 


most sigmi. track, ao/o'("oj 


0.363 ± 0.084 


1 


0.063 ± 0.014 


2 "-most sigmi. track, ao/o"((Xoj 


0.499 ± 0.079 


0.072 ± 0.020 


0.429 ± 0.067 


most signif. track, ZQ/a(zo) 


U.U3d ± O.OUo 


0.1d6 ± U.U53 


0.148 ± 0.033 


2'^'^-most signif. track, zo/a{zo) 


i 


U.Uo ( ± U.UiO 


i 


most signif. track, |ptrk| 


0.019 ± 0.004 


0.057 ±0.019 


0.013 ± 0.003 


2"'*-most signif. track, |ptrk| 


0.009 ± 0.002 


0.013 ± 0.005 


0.012 ± 0.003 


joint probability, R-cp 


0.364 ± 0.066 


0.304 ± 0.091 


0.564 ±0.102 


joint probability, R-z 


0.363 ± 0.061 


0.487 ±0.142 


0.688 ±0.116 


2 vertices found by ZVTOP 


variable 


6-tag 


c-tag 


c-tag (6 bgd.) 


Mpt 


1 


0.438 ± 0.116 


1 


\P\ 


0.071 ± 0.021 


0.128 ± 0.034 


0.082 ± 0.024 


decay length 


0.420 ±0.119 


1 


0.320 ± 0.091 


decay length significance 


0.011 ± 0.006 


0.013 ± 0.007 


0.019 ±0.011 


-^trk,vtx 


0.054 ±0.014 


0.061 ±0.013 


0.070 ± 0.018 


secondary vertex probability 


0.262 ± 0.078 


0.106 ±0.028 


0.141 ±0.042 


joint probability, R-cp 


0.083 ± 0.026 


0.064 ±0.018 


0.108 ± 0.034 


joint probability, R-z 


0.147 ±0.057 


0.168 ±0.061 


0.125 ± 0.049 


3 vertices found by ZVTOP 


variable 


6-tag 


c-tag 


c-tag (6 bgd.) 


Mpt 


0.500 ± 0.268 


0.814 ± 0.435 


0.745 ± 0.398 


\P\ 


0.173 ±0.096 


0.553 ± 0.307 


0.306 ± 0.169 


decay length 


1 


1 


1 


decay length significance 


0.013 ±0.010 


0.028 ± 0.021 


0.019 ±0.014 


-^trk.vtx 


0.418 ±0.172 


0.829 ± 0.341 


0.775 ± 0.318 


secondary vertex probability 


0.349 ± 0.504 


0.731 ± 1.050 


0.394 ± 0.569 


joint probability, R-(j) 


0.015 ±0.011 


0.048 ± 0.036 


0.028 ± 0.021 


joint probability, R-z 


0.035 ± 0.033 


0.127 ±0.123 


0.062 ± 0.061 



Table 7 

Relative importance li/Imax of variables used as inputs for flavour tag neural nets 
at 500 GeV. 

32 



obtained at the Z resonance and at 500 GeV, respectively, where for each 
neural network, values Jj are normalised to the maximum value /max- 

At the Z resonance, in case only the primary vertex is found by ZVTOP, the 
joint probability variables in and z provide the best handle for distin- 
guishing between different jet flavours. This is to be expected given that these 
variables combine information from all the tracks in the jet, rather than re- 
sulting from only one of them (as is the case for the other six inputs). For the 
c-tag provided for the case that all backgrounds are present, the momentum 
of the most significant track in the jet also contributes significantly to the 
flavour tag result. The other variables contribute to a much lesser extent. At 
500 GeV, the joint probability variables are still important, but the impact 
parameter signiflcances of the most and second most signiflcant track in the 
jet play a similarly important role in jet flavour identiflcation. 

If at least two vertices are found in a jet, the Prcorrected vertex mass pro- 
vides the clearest indication of the jet flavour. Other important variables are 
the seed vertex decay length, particularly if exactly two vertices are found, 
and the number of tracks in the seed vertex, especially if three or more ver- 
tices are found, as well as the vertex momentum |p| and the secondary vertex 
probability. As can be expected, the relative importance of the decay length 
increases with increasing jet energy, and surpasses that of the Mpt variable 
for the three- vertex case at 500 GeV. 



5 Quark charge determination 

5. 1 Quark charge for b and c jets 

Quark charge determination, i.e. reconstruction of the charge sign of the heav- 
iest quark in the leading hadron of a jet, will be important for a range of 
measurements at a Linear Collider. An example is the measurement of the 
left-right asymmetry Alr in e"'"e~ ^ '-f/Z ^ bb and e~^e~ ^ 'j / Z ^ cc, which 
is sensitive to new physics phenomena beyond the direct energy reach of the 
ILC and LHC, such as Z' exchange, leptoquarks, -R-parity-violating scalar 
particles and extra spatial dimensions [2^50] . Quark charge measurement can 
also help reduce combinatoric backgrounds in multi-jet events. 

The algorithm for reconstructing the quark charge differs for b and c jets. 
Flavour tagging is therefore a prerequisite for quark charge determination. 
Reconstruction is more challenging for b than for c jets, as b decay chains tend 
to be more complex than c decays. In a pure b jet sample, the b hadron is 
charged in about 40 % of the jets. For the cases in which a non-zero vertex 
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charge is found, the quark charge sign is given directly by the vertex charge. 



To calculate the vertex charge, all tracks that are thought to belong to the 
decay chain are identified and the vertex charge is given by the sign of the sum 
of their charges. The rare cases in which the vertex charge has an absolute 
value above 2, indicating imperfections in the assignment of tracks to the decay 
chain, are discarded in the determination of the quark charge sign. To assign 
tracks to the decay chain, initially the same approach is used as is described 
in Section for Mpt. In addition, if more than two vertices are found, the 
tracks contained in the secondary vertex are included in the Qsum calculation, 
even if they fail the L/D cut, which helps in the case of charm decays with 
a long decay length, corresponding to a large distance D of the charm decay 
from the IP and hence a comparatively small L/D value for the tracks from 
the B decay vertex. 

For c jets, the vertex charge is given by the charge sum, Qsum, in the same 
way as for h jets, with the same algorithm being used for the assignment of 
tracks to the decay chain as in the calculation of Mpt. The special procedure 
for the assignment of tracks from a secondary vertex as described above for h 
jets with more than two vertices is not applied, as there is no motivation for 
it for c jets, and it was found to degrade performance. 




-1 +1 +2 

Reconstructed vertex charge 



.2. 1000- 



800- 
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400 
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-2 -1 +1 +2 
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Fig. 14. Reconstructed vertex charge in comparison to the parton charge in (a) bot- 
tom jets and (h) charm jets, at the Z resonance. 



Figure [14] demonstrates the ability to distinguish the parton charge sign in 
heavy fiavour jets. No information about the parton charge is obtained for 
the case of a reconstructed vertex charge of zero, the dominant contribution 
to which is from vertices with exactly two tracks attached. Good charge sep- 
aration can be achieved by selecting jets with non-zero reconstructed vertex 
charge. Table [8] quantifies the performance of parton charge identification with 
standard LCFIVertex parameters. The ratio of correct over wrong charge iden- 
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Monte Carlo 

jet type 


Reconstructed vertex charge 




correct ambiguous wrong no tracks 


yi.zvjev jets 
c jets 


"iO Q'? 1 19 9 91 Q 
OZ.o OO.i iZ.Z zi.y 

14.6 27.4 3.2 54.8 


500 GeV 6 jets 

r jets 


27.0 23.9 13.5 35.6 

13.0 21.5 1.2 58.1 



Table 8 

Performance ofparton charge identification using LCFIVertex default settings. Non- 
zero vertex charge measurement with the same (opposite) sign as the parton charge 
is labelled "correct" ("wrong"). Jets with a reconstructed vertex charge of zero are 
called "ambiguous". Jets where no tracks pass the strict track quality criteria for 
the vertex charge measurement are listed in the rightmost column. 

tification is better for c jets than for b jets, and the reconstruction quahty is 
found to degrade shghtly with increasing coUision energy as expected. 



6 Summciry 

Precision measurements at the International Linear Collider will rely on ex- 
cellent vertexing capabilities of the vertex detector and the reconstruction 
software. The LCFIVertex software provides vertexing, flavour tagging and 
vertex charge reconstruction algorithms which are being used for optimisation 
of the ILC detectors in the current R&D phase of the project. 

Two vertexing algorithms are provided, which were originally developed at 
the SLD experiment: the ZVRES algorithm, being more generally applica- 
ble, and the ZVKIN algorithm, being specifically tailored to vertex finding 
for bottom jets. For a typical ILC detector design and using two-jet events 
at ^/s — 91.2 GeV, the ZVRES approach is expected to provide a secondary 
vertex finding efficiency of 90% for b jets. At 500 GeV, the assignment 
of tracks to vertices is affected by the increased level of final-state radiation 
and the decreased opening angles of jets. The LCFIVertex implementation 
of ZVRES takes the energy dependence into account in the definition of the 
vertex function, the mathematical representation of the jet topology. A dedi- 
cated study of how vertexing performance varies with jet energy could result 
in further improvements. 

The flavour tagging algorithm provided by the LCFIVertex package is based 
on the use of artificial neural networks. Different networks are used for bot- 
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torn and charm jets and for the cases that one, two or at least three vertices 
were found in the input jet. For two-jet events at the Z resonance, h jets can 
be selected with 90 % purity for an efficiency of 70%. At 500 GeV, the 
corresponding 6-tag purity is ~ 80 %. A method for monitoring the relative 
importance of the flavour tag input variables for each of the neural networks 
is provided with the code. Important input variables include vertex mass, 
momentum and decay length as well as the multiplicity of tracks assigned to 
vertices. Opportunities for future studies in this area include the change of pa- 
rameters used in the calculation of the current flavour tag inputs, the addition 
of further variables, change of neural network architecture and exploration of 
other classification approaches such as boosted decision trees. 



A Joint probability: parameters used in the calculation 



It follows from the definition of the joint probability Pj, see Section IT2l that 
it depends on the distribution of impact parameter significances of IP tracks. 
The function /(x) that approximates this distribution is determined from the 
data as follows: for IP tracks, the impact parameter significance distributions 
are symmetric. The shape of the distribution of positive impact parameters, 
used to find Pj, can therefore be determined by fitting the distribution of 
absolute values for tracks with negative impact parameter, corresponding to 
a very pure IP track sample. The distribution is approximated by fitting the 
sum of a Gaussian and two exponentials to the distribution, i.e. by finding 
parameters p/o, . . ., p/e such that 



f{A = Pfo ■ exp 1^-0.5 — — j j + exp + pj^x) + exp (p/5 + pjqx) 

describes the measured distribution. With this approximation, the integral 
Ib/'a f{x)dx can be written in the form 




+ pn (exp {-pnx) - exp {-pnh/ab)) 
+ Pn (exp {-pmx) - exp {-pub/cTb)) 
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where the integral over the Gaussian is cut off at (&/o"6)cut and the following 
parameter transformation is used: 



Pio 


= Pf2 




2 


Pn 








Pl2 






2 


Pn 








PlA 


= -Pf6 



exp {pfs/ {pfoPf2PfA)) 



In the code, the fit and parameter transformation are performed in the same 



parameter 


joint probability in R-cj) 


joint probability in z 


Pio 


0.843 


0.911 


Pii 


0.365 


0.306 


Pn 


0.620 


0.423 


Pn 


0.150 


0.139 


PIA 


0.029 


0.028 



Table A.l 

Parameters used in the calculation of the joint probability in R-(j) and in z, respec- 
tively, obtained from a fit to negative impact parameter distributions. 

processor. For the detector geometry LDCPrime_02Sc it yields the parameters 
listed in Table lATTl 



B Values of code parameters 



The values of the code parameters that were used to obtain the results pre- 
sented in this paper are listed in Table IB.ll 



C Versions of software packages used 



The version numbers of the various software packages used to obtain the results 
presented in this paper are listed in Table IC.li 
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ZVRES 


flavour tag 


vertex charge 


parameter 


value 


parameter 


value 


parameter 


value 


Wip 


i 







7qb,max (mm) 


i 


K 


U.LZO 


Ptrk,NL,min I'-^eVJ 


1 
i 


V-'^/^^qb,max 


Z.O 




U.D 


Ptrk,NL-l,min(,^e V j 


z 


l-^/-^)qb,min 


U.io 


ATRIM 


1 n 


AT" , 

-'''trks,min 


1 
i 


-tqc.max (mmj 


i 


Xo 


1 n 
iU 


Anorm,max 


on 


^ r / n^ 


z.o 






Tmax (mm) 


1 


(' r / n\ 

V ' /qc,min 


u.o 








Z.O 










(T lrt\ 


n 1 8 










N 


2 










■W^PT.max 


3 










^'Vurr.iiiax 


2 










{b/(Tb)cut 


200 







Table B.l 

Values of code parameters used for results presented in this paper. 



parameter 


version number 


Pythia 


6.4.10 


MOKKA 


06-06-p03 


Marlin 


vOO-10-03 


MarlinReco 


vOO-10-04 


PandoraPFA 


V02-02-02 



Table C.l 

Version numbers of software packages used in this paper. 
References 



[1] ILC Global Design Effort and World Wide Study, J. Brau (Ed.) et al., ILC 
Reference Design Report (RDR), SLAC-R-857 (2007). 

[2] T. Behnkc, S. Bertolucci, R. D. Heuer and R. Settles (Eds.), TESLA: The 
superconducting electron positron linear collider with an integrated X-ray laser 
laboratory. Technical design report. Pt. 4: A detector for TESLA, ECFA-2001- 
20. 

[3] The LHC/ILC Study group, G. Weiglein et al., Phys. Rept. 426 (2006) 47. 



38 



[4] available from LCFIVertex homepage 

http : / / ilcsof t . desy . de/portal/ sof tware_packages/lcf ivertex/. 

[5] Browser for Detector Models in the MOKKA Geometry Database, 

http : //www-f Ic . desy . de/ldcoptimization/tools/mokkamodels .php? 
model=LDCPrime_02Sc. 

[6] T. Sjostrand, S. Mrenna and P. Skands, JHEP 0605 (2006) 026, 
(arXiv:hep-ph/0603175) . 

[7] S. Agostinelh et al., Nucl. Instr. Meth. A 506 (2003) 250; J. AUison et al., IEEE 
Trans. Nucl. Sci. 53 (2006) 1. 

[8] P. Mora de Preitas, Mokka, Main Guidelines And Future, Prepared for 
International Conference on Linear Colliders (LCWS 04), Paris, France, 19-24 
Apr 2004. 

[9] F. Gaede, T. Behnke, N. Graf and T. Johnson, LCIO: A persistency framework 
for linear collider simulation studies. In the Proceedings of 2003 Conference 
for Computing in High-Energy and Nuclear Physics (CHEP 03), La Jolla, 
Cahfornia, 24-28 Mar 2003, pp TUKTOOl, ( |arXiv:physics/0306114D . 

[10] O. Wendt, F. Gaede and T. Kramer, Event reconstruction with MarlinReco at 



the ILC, arXiv:physics/0702171 



[11] F. Gaede, Nucl. lustrum. Meth. A 559 (2006) 177. 

[12] A. Raspereza, X. Chen, A. Frey, Proceedings of LCWS07, DESY, June 2007. 

[13] MarlinReco homepage 

http : //ilcsof t . desy . de/MarlinReco/vOO-lO-04/doc/htnil/index . html. 

[14] M. A. Thomson, Particle Flow Calorimetry at the ILC, Proceedings of LCWS06, 
Bangalore, March 2006, (arXiv:physics/0607261 ). 

[15] A. Raspereza, Proceedings of 2005 ALCPG and ILC workshops, Snowmass, 
U.S.A., econf/C0508141, (arXiv:physics /0601069 ) . 

[16] S. Yamashita, Attempt to Develop New Jet-Particle Association Methods for 
Four- Jet Topology with OPAL Detector, OPAL Technical Note TN579 (1998), 
unpublished. 

[17] D. J. Jackson, Nucl. lustrum. Meth. A 388 (1997) 247. 

[18] B. Jeffery, Flavour and charge identification using vertex information at the 
ILC, PhD thesis, Oxford University, UK (in preparation). 

[19] J. Thom, Search for B'^-B^ Oscillations with a Charge Dipole Technique at 
SLD, PhD thesis, SLAC-R-585 (2002). 

[20] W. H. Press, B. P. Flannery, S. A. Teukolski and W. T. Vetterling, Numerical 
Recipes in C: The Art of Scientific Computing, 2"^^ edition, Cambridge 
University Press, 1992. 



39 



[21] K. Abe et al., Nucl. Instrum. Meth. A 400 (1997) 287. 

[22] SLD Collaboration, T. Abe et al., Nucl. Instrum. Meth. A 447 (2000) 90. 

[23] R. Hawkings, Vertex detector and flavour tagging studies for the TESLA Uncar 
coHider, LC-PHSM-2000-021-TESLA. 

[24] C. Amsler et al., Particle Data Group, PL B667, 1 (2008), 
(http://pdg.lbl.gov). 

[25] ALEPH Collaboration, D. Buskulic et al., Phys. Lett. B 313 (1993) 535. 

[26] C. M. Bishop, Neural Networks for Pattern Recognition, Oxford University 
Press, 1995. 

[27] J. Hertz, A. Krogh and R. Palmer, Introduction to the theory of neural 
computation, Addison Wesley Publishing Company, 1991. 

[28] A. Hoecker, P. Speckmayer, J. Stelzer, F. Tegenfeldt, H. Voss, K. Voss, 
TMVA Toolkit for Multivariate Data Analysis with ROOT Users Guide, 
arXiv :physics/0703039[ 

[29] J. Hewett, Phys. Rev. Lett. 82 (1999) 4765. 

[30] S. Riemann, Fermion-Pair Production at a Linear Collider - A Sensitive Tool 
for New Physics Searches, LC-TH-2001-007. 



40 



